Comprehensive Workload Analysis and Modeling of a Petascale Supercomputer
نویسندگان
چکیده
The performance of supercomputer schedulers is greatly affected by the characteristics of the workload it serves. A good understanding of workload characteristics is always important to develop and evaluate different scheduling strategies for an HPC system. In this paper, we present a comprehensive analysis of the workload characteristics of Kraken, the world’s fastest academic supercomputer and 11th on the latest Top500 list, with 112,896 compute cores and peak performance of 1.17 petaflops. In this study, we use twelve-month workload traces gathered on the system, which include around 700 thousand jobs submitted by more than one thousand users from 25 research areas. We investigate three categories of the workload characteristics: 1) general characteristics, including distribution of jobs over research fields and different queues, distribution of job size for an individual user, job cancellation rate, job termination rate, and walltime request accuracy; 2) temporal characteristics, including monthly machine utilization, job temporal distributions for different time periods, job inter-arrival time between temporally adjacent jobs and jobs submitted by the same user; 3) execution characteristics, including distributions of each job attribute, such as job queuing time, job actual runtime, job size, and memory usage, and the correlations between these job attributes. This work provides a realistic basis for scheduler design and comparison by studying the supercomputer’s workload with new approaches such as using Gaussian mixture model, and new viewpoints such as from the perspective of user community. To the best of our knowledge, it’s the first research to systematically investigate the workload characteristics of a petascale supercomputer that is dedicated to open scientific research.
منابع مشابه
Application Workloads on the Jaguar Cray XT5 System
In this study we investigate computational workloads for the Jaguar system during its tenure as a 2.3 petaflop system at Oak Ridge National Laboratory. The study is based on a comprehensive analysis of MOAB and ALPS job logs over this period. We consider Jaguar utilization over time, usage patterns by science domain, most heavily used applications and their usage patterns, and execution charact...
متن کاملWorkload Analysis of Blue Waters
Blue Waters is a Petascale-level supercomputer whose mission is to enable the national scientific and research community to solve"grand challenge"problems that are orders of magnitude more complex than can be carried out on other high performance computing systems. Given the important and unique role that Blue Waters plays in the U.S. research portfolio, it is important to have a detailed under...
متن کاملSystem-Wide Tradeoff Modeling of Performance, Power, and Resilience on Petascale Systems
While performance remains a major objective in the field of high-performance computing (HPC), future systems will have to deliver desired performance under both reliability and energy constraints. Although a number of resilience methods and power management techniques have been presented to address the reliability and energy concerns, the tradeoffs among performance, power, and resilience are n...
متن کاملCUDA Center of Excellence at Illinois CUDA Achievement Award Submission “Fighting HIV with CUDA Technology from the Desktop to the Petascale”
Figure 1: The first all-atom structure of the complete HIV virus capsid [1], a breakthrough enabled by CUDA technology in NAMD, VMD, and the Blue Waters supercomputer at Illinois. The first scientific breakthrough achieved with the Blue Waters supercomputer at the University of Illinois was the determination of the structure of the complete HIV capsid in atomic-level detail, a collaborative eff...
متن کاملStrong scaling analysis of a parallel, unstructured, implicit solver and the influence of the operating system interference
PHASTA falls under the category of high-performance scientific computation codes designed for solving partial differential equations (PDEs). Its a massively parallel unstructured, implicit solver with particular emphasis on fluid dynamics (CFD) applications. More specifically, PHASTA is a parallel, hierarchic, adaptive, stabilized, transient analysis code that effectively employs advanced aniso...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012